The Annals of Statistics 

2012, Vol. 40, No. 3, 1578-1608 

DOI: 10.1214/12-AOS1015 

@ Institute of Mathematical Statistics, 2012 



THE LIMIT DISTRIBUTION OF THE Xoo-ERROR OF 
GRENANDER-TYPE ESTIMATORS 

By Cecile Durot, Vladimir N. Kulikov and Hendrik P. Lopuhaa 

_ . University of Nanterre, ASR Nederland and Delft University of Technology 

^ ■ Let / be a nonincreasing function defined on [0,1]. Under stan- 

dard regularity conditions, we derive tiie asymptotic distribution of 
tfie supremum norm of the difference between / and its Grenander- 
type estimator on sub-intervals of [0, 1] . Tfie rate of convergence is 
found to be of order {n/ \ogn)~^^^ and tlie limiting distribution to 
be Gumbel. 

1. Introduction. After the derivation of the nonparametric maximum 
Ukelihood estimator (NPMLE) of a monotone density and a monotone fail- 
ure rate by Grenander [10], and the least squares estimator of a monotone 
regression function by Brunk [4], it has taken some time before the distribu- 
tion theory for such estimators entered the literature. The limiting distribu- 
tion of the NPMLE of a decreasing density on [0, oo) at a fixed point in the 
interior of the support, has been established by Prakasa Rao [27]. Similar 
results were obtained for the NPMLE of a monotone failure rate in [28] and 
for an estimator of a monotone regression function in [5]. Woodroofe and 
Sun [32] showed that the NPMLE of a decreasing density is inconsistent at 
zero. The behavior at the boundary has been further investigated in [2, 23]. 
Smooth estimation has been studied in [25], for monotone regression curves, 
and in [31] for monotone densities; see also [9] and [1]. The limit distribution 
of the NPMLE of a decreasing function in the Gaussian white noise model 
was obtained in [33]. Related likelihood ratio based techniques have been 
investigated in [3, 26]. 

Groeneboom [11] reproved the result in [27] by introducing a new ap- 
proach based on inverses. This approach has become a cornerstone in de- 
riving pointwise asymptotics of several shape constrained nonparametric 
estimators, for example, for the distribution function of interval censored 
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observations (see [15]) or for estimators of a monotone density and a mono- 
tone hazard under random censoring (see [17]); see also [18] for the hmiting 
distribution of the NPMLE of a monotone density under random censoring 
and [24] for similar results on isotonic estimators for a monotone baseline 
hazard in Cox proportional hazards model. The limit distribution of these es- 
timators involves an argmax process {C(c) '■ c G K} connected with two-sided 
Brownian motion with a parabolic drift. This process has been studied ex- 
tensively in [12], where it is also claimed that the approach based on inverses 
should be sufficiently general to deal with global measures of deviation, such 
as the Li-distance or the supremum distance between the estimator and the 
monotone function of interest. Indeed, the limiting distribution of the Li- 
distance between a decreasing density and its NPMLE was obtained in [14], 
and a similar result can be found in [6] in the monotone regression setup. 
These results were extended to general L^-distances in [22] and [7]. In [7], 
the limiting distribution of L^-distances is obtained in a very general frame- 
work that includes, among others, the monotone density case, monotone 
regression and monotone failure rate. 

Little to nothing is known about the behavior of the supremum distance. 
In [19], the rate of the supremum distance is established in a semi-parametric 
model for censored observations, and it is suggested that the same rate 
should hold in the monotone density case. In [16] an extremal limit theorem 
has been obtained for suprema of the process C(c) over increasing intervals. 
However, a long-standing open problem remains, although this problem has 
important statistical applications: what is the limiting distribution of the 
supremum distance between a monotone function and its isotonic estima- 
tor? Indeed, while pointwise confidence intervals for a decreasing density, 
a monotone regression function or a monotone hazard are available using 
the limiting distribution of the isotonic estimator at the fixed point, non- 
parametric confidence bands have remained a formidable challenge; they 
could be built if the limiting distribution of the supremum distance between 
a monotone function and its isotonic estimator were known. It is the purpose 
of this paper to settle this question in the same general framework as con- 
sidered in [7] . The precise construction of a nonparametric confidence band 
requires additional technicalities that are beyond the scope of the present 
paper. It is only briefly discussed here, and details are deferred to a separate 
paper. ^ 

We consider Grenander type estimators /„ for decreasing functions / with 
compact support, say [0, 1]. These are estimators that are defined as the left- 
hand slope of the least concave majorant of an estimator for the primitive 
of /. This setup includes Grenander's [10] estimator of a monotone den- 
sity, Brunk's [4] estimator for a monotone regression function, as well as the 
estimator for a monotone failure rate under random censoring, considered 
in [17]. We obtain the rate of convergence for the supremum of \fn — f\ over 
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subintervals of [0, 1]. The rate is shown to be of the order (n/logn)~^/^, even 
on subintervals that grow toward [0, 1] , as long as one stays away sufficiently 
far from the boundaries, so that the inconsistency at the boundaries (see, 
e.g., [32]) is not going to dominate the supremum. The rate that we obtain 
coincides with the one suggested in [19] for Grenander's [10] estimator for 
a decreasing density, but it is now proven rigorously in a more general set- 
ting under optimal conditions on the boundaries of the intervals over which 
sup \ fn — f\ is taken. Moreover, we show that the rate (n/ log n)~^/'^ is sharp. 
Our main result is Theorem 2.2, in which we show that a suitably standard- 
ized supremum of \fn — f \ converges in distribution to a standard Gumbel 
random variable. 

Our results are obtained following the same sort of approach as that used 
in [6, 7, 11, 14, 17], among others. We first establish corresponding results 
for the supremum of the inverses of fn and /, and then transfer them to 
the supremum of /„ and / themselves. A major difference with deriving 
asymptotics of L^-distances is, that in these cases one can benefit from the 
linearity of the integral and handle several approximations pointwise with 
Markov's inequality. This is no longer possible with suprema. With suprema, 
to transfer results for inverses to results for /„, a key ingredient is a precise 
uniform bound on the spacings between consecutive jump points of fn- 

The paper is organized as follows. In Section 2, we list the assumptions 
under which our results can be obtained and state our main results con- 
cerning the rate of convergence and the limiting distribution of sup \fn — f\- 
We also briefly discuss the construction of confidence bands. We formulate 
corresponding results for the supremum distance between the inverses of /„ 
and / in Section 3. This is the heart of the proof, which is carried out in 
Section 4. Finally, in Section 5, we provide a uniform bound on the spacings 
between consecutive jump points^of fn and then transfer the results obtained 
in Section 3 for the inverses of /„ and / to the supremum distance between 
the functions themselves. 

To limit the length of the paper, the rigorous proofs of several prelim- 
inary results needed for the proofs in Sections 4 and 5 have been put in 
a supplement [8]. 

2. Assumptions and main results. Based on n > 2 independent observa- 
tions, we aim at estimating a function / : [0, 1] — > M subject to the constraint 
that it is nonincreasing. Assume we have at hand a cadlag (right continuous 
with finite left-hand limits at every point) stepwise estimator F„ of 

F{t)= f f{u)du, tG[0,l], 
JO 

with finitely many jump points. In the case of i.i.d. observations with a com- 
mon density function /, a typical example is the empirical distribution func- 
tion with n discontinuity points located at the observations. In the following. 
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we shall consider the monotone estimator /„ of / as defined in [7], that is, the 
estimator /„ is the left-hand slope of the least concave majorant of -F„ with 

/„(0)=lim/„(t). 

As detailed in Section 2.1 below, this definition generalizes well-known mono- 
tone estimators, such as the Grenander estimator of a nonincreasing density, 
or the least-squares estimator of a monotone regression function. It should 
be noted that fn is nonincreasing, left-continuous and piecewise constant. 
We are interested in the limiting behavior of the supremum distance between 
the monotone estimator and the function /. 

2.1. Uniform rate of convergence. We first show that the rate of conver- 
gence of /„ to / in terms of the supremum distance is of order (log n/n)^/^. 
To this end, we make the following assumptions. Unless stated otherwise, 
for a function h defined on [0, 1], we write ||/i||oo = sup^g^ 1^(01- 

(Al) The function / is decreasing and differentiable on [0, 1] with 

inf \f'{t)\ > and sup \ f'{t)\ < oo. 
te[o,i] te[o,i] 

(A2) Let Bn be either a Brownian bridge or a Brownian motion. There 
exist g > 4, Cq> 0, L : [0, 1] ^ M and versions of Fn and Bn such that 

F{n^~^/'^\\Fn-F-n~'^/^BnoL\\^>x)<CgX~'' 

for all X € (0,n]. Moreover, L is increasing and differentiable on [0,1] with 
inft6[o,i] L'{t) > and sup^gp^i] L'{t) < oo. 

(A3) There exists Cq > such that for all x > and t = 0,l, 

e\ sup {Fniu)-Fiu)-Fnit) + F{t)f]<^. 

'-ug[0,l],x/2<|t-«|<a; ^ 

These conditions are similar to the ones used in [7]. Assumption (Al) is com- 
pletely the same as the one in [7]. Assumption (A2) is similar to (A4) in [7], 
but now we only require g > 4 and bounds on the first derivative of L. Here 
we can relax the condition on q, because in the current situation the error 
terms have to be of smaller order than (n/logn)^/^ instead of n^/^ in [7]. The 
existence of L" , as imposed in (A4) in [7], is not needed to establish Theo- 
rem 2.1. Finally, assumption (A3) is equal to (A2') in [7]. Assumption (A2) 
in [7] is no longer needed, since we are able to obtain sufficient bounds on 
particular tail probabilities with our current assumptions (Al)-(A2). See 
Lemma 6.4 and also the proof of Lemma 6.10 in [8]. 

A typical example that falls into the above framework is the problem 
of estimating a nonincreasing density / on [0,1]. Assume we observe i.i.d. 
random variables Xi , X2 , • ■ • , Xn with common nonincreasing density func- 
tion / : [0, 1] — )• M, and let Fn be the corresponding empirical distribution 
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function. In this case, the monotone estimator /„ of / coincides with the 
Grenander estimator. Assumption (Al) is equal to the ones in [7, 14, 22], 
and is standard when studying L^-distances between /„ and /. The exis- 
tence of a second derivative of / is not needed to obtain Theorem 2.1. In 
the monotone density model, assumption (A2) is satisfied for all q> 0, with 
L = F being the distribution function corresponding to / and Bn a Brown- 
ian bridge, due to the Hungarian embedding of [20]. From Theorem 6 in [7] 
it follows that assumption (A3) holds in the monotone density model. An- 
other example that falls into the above framework is the problem of esti- 
mating a monotone regression function. Assume for instance that we observe 
Hi = /(V"-) i = 1, 2, . . . , n, where the e^'s are i.i.d. centered random vari- 
ables with a finite variance o"^, and / : [0, 1] ^ R is nonincreasing. Let be 
the partial sum process given by 

1 " 

n ^-^ ~ 

i=l 

In this case, the monotone estimator /„ of / coincides with the Brunk esti- 
mator. Assumption (Al) is equal to the ones in [6, 7] and is standard when 
studying L^-distances in this model. Assumption (A2) is satisfied for all 
q>2 such that E|ej|''' < oo with L[t) = a'^t and Bn a Brownian motion, due 
to embedding of [29]. Thus, (A2) is satisfied in the above regression model 
provided E|ej|^ < oo. From Theorem 5 in [7] it follows that assumption (A3) 
holds in the above regression model. Other examples of statistical models 
that fall in the above framework, with corresponding q and L, are discussed 
in [7]. 

The uniform rate of convergence of fn to / for general Grenander-type 
estimators is given in the following theorem. 

Theorem 2.1. Assume (Al), (A2) and (A3). Let (a„)„, and {Pn)n be 
sequences of positive numbers such that 

(1) a,, > /Cm" (log n) -2/2 and (3n> K2n-^^^{logn)-'^^^ 
for some Ki,K2 > that do not depend on n. Then, 

sup \Ut)-f{t)\=oJ^^\'\ 

t6(a„,l-/3n] V / 

The rate in Theorem 2.1 coincides with the one found for the maximum 
likelihood estimator in a semi-parametric model for censored data by Jonker 
and van der Vaart [19], who suggest that this rate should also hold for 
Grenander's [10] estimator for a decreasing density. They consider a„ ^ 
7),"i/3(^lQg g^]^(^ constant, which is a slightly stronger assumption 
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than the one in Theorem 2.1. Note that condition (1) in Theorem 2.1 is 
sharp. If On = n""^, for some 1/3 < 7 < 1, then n^^"''')/^ (/„(«„) — /(a„)) 
converges in distribution, according to Theorem 3.1(i) in [23], so that 

{n/lognf/''^\fn{an) - /(a„,)| ^ 00. 
In fact, for sequences (q„)„, such that n^/'^(logn)^/^a„ — )• 0, it can be shown 
similarly that (nan)^^'^ {fn{o(n) — converges in distribution, which 

would yield (n/logn)^/^|/n(an) - /(««)! 00. 

2.2. Limiting distribution. Whereas the previous theorem only provides 
a bound on the rate of convergence, it is nevertheless crucial for deriving the 
actual asymptotics of the supremum norm of — / on suitable intervals. 
For this purpose, we need an additional Holder assumption on /' and L" . 

(A4) The function L in (A2) is twice differentiable and there exist Co > 
and a G (0, 1] such that for all t,u £ [0, 1], 

(2) \f'{u)-f'{t)\<Co\u-t\'' and \L" (u) - L" {t)\ < Co\u - tl'' . 

The condition on L" in assumption (A4) is a bit stronger than the one 
in [7]. This is needed to guarantee that the difference between the values 
of L" at t and its nearest point of jump of fn is negligible. The condition 
on /' in assumption (A4) is the same as (4) in [7], who already observed 
that the existence of /", as assumed in [14, 22], is no longer needed. Note 
that in the monotone density model L" = /', in which case (A4) reduces to 
a Holder condition on /' only. In the monotone regression model, L is linear 
so that (A4) again reduces to a Holder condition on /' only. 

In order to formulate the limit distribution, we need the following defini- 
tion: 

(3) C(c) = argmax{M/(t + c)-t^} for aU c € M, 

where Pt^ is a standard two-sided Brownian motion on M originating from 
zero, and argmax denotes the greatest location of the maximum. For fixed 
t G (0,1), properly scaled versions of n^^^{fn{i) — fit)) converge in distri- 
bution to the random variable C(0) (see, e.g., [27] or [11]). Moreover, 
serves as the limit process for properly scaled versions of n^^^{Un — g) (see, 
e.g., Theorem 3.2 in [14]), where [/„ and g are the inverse functions of fn 
and / respectively, as defined in Section 3 below. Properties of the process 
{C(c),cGM} can be found in [12]; for example, the process {C(c),cGM} 
is a stationary process. According to Corollary 3.4 in [12], the tails of the 
density of C(0) satisfy the following expansion: 

(4) /Li(t) ~2A|t|exp(-2|tp/3-K|t|) 
as |t| — > cxj, where k and A are positive constants. 
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We now present the main result of this paper. It states that the hmit 
distribution of the supremum distance between /„ and /, if properly nor- 
malized, is Gumbel. By Xn ^ Vn we mean Xn/Vn oo, as n — > oo. 

Theorem 2.2. Assume that (Al), (A2), (A3) and (A4) hold. Consider 
<u <v <1 fixed. Then, for any sequence of real numbers (a„)n CLi^-d {/3n)n 
both satisfying 

(5) a„^0, (3n-^0 and 1 - -y + /3n, it + On > ?^~"^^^(logn)~^/^, 
we have that for any x € M, 

as n — 7- oo , where 



K 1 

(6) /in = 1 - o1/.n_^,.N9/. + 



2^/3 (log logn 



^loglogn + log(AC/,L) 



with 



U(t) 



and A and k taken from (4). 



Note that from Theorem 2.2, with u = and v = 1, it follows that for 
convenient an and /3„,, 

n \fnit)-f{t)\ , 

lognj t.(«J-,„]|2/'(t)L'(i)|V3-l + "-^^^- 

Since both /' and L' are bounded from above and bounded away from zero, 
this proves that there are positive numbers Ci,C2 that depend only on /' 
and L' such that 

1/3 



Cl+Op(l)<(^) sup \fn{t)-f{t)\<C2 + 0p{l). 



n 

This means that the rate in Theorem 2.1 is sharp 



2.3. Confidence bands. Our main motivation for proving Theorem 2.2 is 
to build confidence bands for a monotone function /. Indeed, this theorem 
ensures that for any j; € M, with probability tending to exp(— e~^), we have 

\fn{t) - f{t)\ < (^-^Y'\2f'{t)L'{t)\'/'{f^n + ^ 

\ n J I log n 

simultaneously for alH G (n + a„, f — . Combining this with either plug- in 
estimators of /' and L' or bootstrap methods would provide a confidence 
band for /, at the price of additional technicalities. Indeed, the use of plug- 
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in estimators for the derivatives /' and L' may lead to inaccurate intervals 
for small sample sizes n, so that bootstrap methods should be preferable. 
But it is known that the standard bootstrap typically does not work for 
Grenander-type estimators; see [21, 30]. Thus, we shall use a smoothed boot- 
strap, which will raise the question of the choice of the smoothing parameter. 
In view of all this, we believe that the precise construction of a confidence 
band is beyond the scope of the present paper and is deferred to a separate 
paper. 

Note that the conditions of Theorem 2.2 do not cover the supremum 
distance over the whole interval [0, 1]. However, this is to be expected. For 
instance, consider the monotone density model. This model is one of the 
examples that is covered by our general setup (see Section 2.1) and it is 
well known that the Grenander estimator /„ in this model is inconsistent 
at and 1 (e.g., see [32]). Therefore, a distributional result can only be 
expected if the supremum is taken over subintervals of [0, 1] that do not 
include and 1. Let us notice, however, that we can obtain a confidence 
band for / on any sub-interval {u,v\ with fixed u,v € (0, 1) (by considering 
Oin = f^n = 0), and that the largest interval on which our result allows to 
build a confidence band is (a„,l — /3„], where a„ » n~^/^(logn)~^/^ and 
similarly, /3„ ^ n~^/^(logn)~'^/^. In order to obtain a confidence band on the 
whole interval [0, 1] , we would have to slightly modify the Grenander-type 
estimator /„ in order to make it consistent near the boundaries. For instance, 
we conjecture that, if we consider either the modified estimator in [23] or 
the penalized estimator in [32] instead of /„, then the limit distribution 
of the supremum distance between this modified estimator and / over the 
whole interval [0, 1] is the same as the limit distribution of the supremum 
distance between fn and / over the largest interval allowed in Theorem 2.2. 
Thus, such modified estimators would provide a confidence band for / over 
the whole interval [0,1]. As mentioned above, the precise construction of 
confidence bands is deferred to a separate paper, and we will do similarly 
with the precise study of modified estimators at the boundaries. 

3. The inverse process. To establish Theorems 2.1 and 2.2, we use the 
same approach as in [6, 7, 11, 14]. We first obtain analogous results (i.e., 
rate of convergence and limit distribution) for the supremum between the 

inverses of^^ and /, and then transfer them to the supremum between the 
functions fn and / themselves. Let be the upper version of Fn defined 
as follows: F+(0) = F„(0) and for every t G (0, 1], 

F+(t) =max|F„(t),limF„(n)|. 

Let Un denote the (generalized) inverse of /„, defined for a € M by Un{a) = 
sup{t G [0,1]: fn{t) > a}, with the convention that the supremum of an 
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U„.{a) 1 



Fig. 1. The function , its concave majorant (dashed) and a line with slope a (solid). 

empty set is zero. This is illustrated in Figure 1 below. From Figure 1, 
it can be seen that the value t = Un{o) maximizes F^{t) — at, so that 

(7) Un{a) = argmax{F^(t) — at}. 

iG[0,l] 

The advantage of characterizing the inverse process [/„ by (7), is that in 
this way, it is more tractable than the estimator /„ itself, as being the 
argmax of a relatively simple process. It is the purpose of this section to 
establish results analogous to Theorems 2.1 and 2.2 for the inverse pro- 
cess. 

Let g denote the (generalized) inverse function of /. In Theorems 3.1 
and 3.2, we give an upper bound for the rate of convergence of Un to g, and 
an extremal limit result for the supremum distance between Un and g. We 
derive the limit distribution of the supremum distance between Un and g in 
Corollary 3.1. 

Theorem 3.1. Assume that (Al) and (A2) hold. Then 
sup|C/„(a) - g{a)\ = O. 
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Theorem 3.2. Assume that (Al), (A2) and (A4) hold, and define for 
a € M the normalizing function 

(8) Aia)- l^'('<"»l^" 



(4L'(9(o)))i/3- 

Let < u < V < 1 fixed, and let (an)n o,n-d {/3n)n be sequences such that 
an^O, /3n — 7> and 0<n + a„<z; — /3„<1 for n sufficiently large. Define 

(9) S„ = ni/3 sup A{a)\Un{a) - g{a)\. 
Then 

(10) F{Sn < Un) ^ exp| -2r j!'}!)}^!!,^ dt 



(4L'(t))V3 

/or any sequence {un)n such that ii„ — ?> oo in such a way that n^/^fi{un) — )• 
r > 0, where fi denotes the density o/C(0), as defined in (3). 

The expansion in (4) allows us to provide a precise expansion of ii„ 
[see (34)] and to derive the following corollary from Theorem 3.2. According 
to this corollary, the limit distribution of Sn is Gumbel. 

Corollary 3.1. Assume that (Al), (A2) and (A4) hold. Let Sn he 
defined by (9), with Q <u <v <1, and an,Pn satisfying the conditions of 
Theorem 3.2. Then, for all x € M, 

rllognU-^) ' 5„-;U„l <xl ^exp{-e-^}, 



logn 

where \in is defined by (6). 

In order to transfer the results for Un to /„, we establish Lemma 5.2. 
This lemma does require conditions on sequences = u + a„ and tn = 
1 — V + (3n that are stronger than the ones in Theorem 2.2. However, once 
we have established the limit distribution for such sequences, we will show 
that Theorem 2.2 can be extended to more general sequences satisfying (5). 

4. Proofs of Theorems 3.1 and 3.2 and Corollary 3.1. We suppose in the 
sequel that assumptions (Al) and (A2) are fulfilled, and we denote by C, Ci, 
C2, . . . positive real numbers that depend only on g, Cq, f, L [and possibly 
on a under the additional assumption (A4)]. These real numbers may change 
from one line to the other. We write xy y = max(x, y) and x Ay = min(x, y), 
for any real numbers x and y. 

In order to deal simultaneously with the cases where Bn is a Brownian 
bridge or a Brownian motion [see assumption (A2)], we shall make use of 
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(11) Bnit) = Wnit)-Ut, te[0,l], 

where Wn is a standard Brownian motion, ^„ = if Bn is a Brownian motion 
and = Wn{l), a standard Gaussian variable that is independent of Bn, 
in case B^ is a Brownian bridge. To prove Tlieorem 3.1, we need some 
preHminary results on the tail probabilities of Un — g and its supremum. 
These results can be found in Supplement B in [8]. A first result, which is 
similar to Lemmas 2, 3 and 4 in [7], is that there exist Ci > and C2 > 
such that for all a G ffi and 2; > 0, 

(12) F{n'/^\Unia) - gia)\ > x) < ^^^^ J^' + 2exp(-C2X^). 

In particular, for all a € M, this implies that Un{a) — g{a) = Op{n~^/^). See 
Lemma 6.4 in [8]. This is not sufficient to obtain Theorem 3.1, but it will 
be used for its proof. 

Proof of Theorem 3.1. RecaU that g{a) = 1 for all a < /(I), 51(a) = 
for a > /(O) and Un is nonincreasing and takes values in [0,1]. Hence, we 
can write 

(13) sup \Un{a) - g{a)\ = |C/„(/(l)) - 5(/(l))| 
and 

(14) sup \Un{a)-g{a)\ = |C/„(/(0)) - ^(/(O))!. 

a>/(0) 

This means that 

sup\Un{a) - g{a)\ = sup \Un{a) - g{a)\. 

am aG[/(l),/(0)] 

Therefore, to prove Theorem 3.1 it suffices to show that 

/I \ 1/3 

^ / log Tl 

sup \Unia) - g{a)\=Op( 

aG[/{i)J{0)] V n 

According to Lemma 6.5 in [8], the bound in (12) can be extended such that 
for any x > 0, 

^ ( log n ^ ^1'^^ 

sup \\Jn{a) - g{a)\> x\ 

ae[/(i),/(o)] V ^ 

where C3 = C3{/(1) — /(O)}. The latter upper bound tends to zero as n ^ 00 
for all X > (3C2)~^/^ since g > 4 by assumption. This completes the proof of 
Theorem 3.1. □ 
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We suppose in the sequel that in addition to (Al) and (A2), assump- 
tion (A4) is fulfilled. The first step in proving Theorem 3.2 is to approximate 
an adequately normalized version of Un{a) by the location of the maximum 
of a Brownian motion with parabolic drift. To this end define 

(15) Vn{a) = ni/3(L(f/„(a«)) - L{g{a))), 
where 

(16) a^ = a- n~^/'^inL'{g{a)) for aU a € M, 

with taken from representation (11). Then for < n < f < 1 and an,(3n 
satisfying the conditions of Theorem 3.2, we obtain 

5„VOp(l)= sup _li^|y„(a)|vOp(l) 

aG[f{v-l3„)J{u+a„)] ^ {9(a)) 

+ Op(n-'^/2(logn)2/3) + Op(n-V6)^ 

where Sn is defined by (9), and a € (0, 1] is taken from (A4). See Lemma 6.6 
in [8]. 

Next, we proceed with localization. The purpose of this is that localized 
versions of V^(a) and Vn{b), can be approximated by independent random 
variables, if a and b are in disjoint intervals that are suitably separated. 
First note that the location of the maximum of a process is invariant under 
addition of constants or multiplication by n^/^. Therefore, from (7) it follows 
that for all a € M we have 

(17) Vn{a) = argmax{Wg^a){t) + Dn{a,t) + Rn{a,t)}, 
where 

(18) /„(a) = [ni/3(L(0) - L{g{a)))y/\L{1) - ma)))] 

for every s G [0,1] fixed, Wg is the standard Brownian motion defined by 

(19) Wsit) = n^/^{Wn{L{s)+n-^/H)-WniL{s))} for t G M, 
with Wn defined by (11), and 

Dn{a,t) = n'^^^{F o - aL^^){L{g{a)) + n^^^h) 
-n"\F{g{a))-ag{a)), 

(20) 

Rn{a, t) = n^'^ia - a^){L-\L{g{a)) + n~^/h) - g{a)) 
-n-^/^Cnt + Rnia,t), 
where is taken from representation (11), and for all a and t, 
(21) \Rn{a,t)\ < -F- n-^I^B^ o L\\^. 
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For all a G M, we define the localized version of Vn{a) by 

(22) Vnia)= argmax {Wg^a)it) + Dnia,t) + Rn{a,t)}. 

t£ln{a):\t\<\ogn 

We find that 

a6[/(t;-/3„),/(n+a„)] ^ \9W) 

T##^lK,(a)|+o,(logn)-2/3 

for any h{a) G M that satisfies \a — h{a)\ < n~"'^/'^(logn)^. See Lemma 6.7 
in [8]. 

Finally, using the fact that, roughly speaking, 

2{L'{g{aW' ^ 2{L'{g{bW' 

for all b close enough to a, we bound |V„(a)| from above and below by the 
absolute value of the following quantities: 

(23) V+{a,b)= argmax j^y ^(t) - ( JZM^ _ 2e„V2| 
and 

(24) V-{a,b)= argmax (,)(i) - ( J/M|ll + ae^V^l 

where /n(a) and Wg(^a) are defined in (18) and (19), b is chosen sufficiently 
close to a, and where (e„)„ is a sequence of positive numbers that converges 
to zero as n ^ oo, which is to be chosen suitably. The purpose of this is 
that when we will vary a over a small interval and fix b to be the midpoint 
of this interval, we will obtain variables Vj^{a,b) that are defined with the 
same drift, 



and the Browian motion Wg(^a) only depending on a. The case of V~{a, b) is 
similar. 

For < u < V < 1, and an,f3n satisfying the conditions of Theorem 3.2, 
we obtain 

Sn < sup _l(M^|y+(a,6(a))| vOp(l) + Op(logn)-2/3^ 
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y^^^^^|V7(a,6(a))|+o,(logn)-2/3 
ae[f{v~i3n)j{u+a„)] ^ {9[b[a))) 



for any 6(a) € M tliat satisfies |a — 6(a)| < n^^/^(logn)^, where Sn is defined 
by (9) and e„ = 1/logn in (24) and (23). See Lemma 6.8 in [8]. 

Note that in order to obtain the above approximations, we use the fohow- 
ing lemma, which is a variation on Lemma 2.1 in [23]. Although very simple, 
it turns out to be a very useful tool to compare locations of maxima. 

Lemma 4.1. Let I cM. be an interval. Let g and Z be real valued func- 
tions defined on L such that there exists 7 > with 

g{u) < g{v) for all u, v such that \u\ > \v\+ j. 

Assume that both sup^gjZ(ii) and sup^^j{Z{u) + g{u)} are achieved. De- 
noting by argmax an arbitrary point where the maximum is achieved, we 
have 



argmax{Z(u) + g{u)} 



< 



argmax{Z(ii)} 

U&I 



+ 7- 



Proof. Suppose the maximum of Z is achieved at v £ L, so that Z{u) < 
Z{v) for all n G /. It is assumed that for all u € / such that \u\ > \v\ +7, we 
have g{u) < g{v). Therefore, 

Z{u)+g{u)<Z{v)+g{v) 

for all n G / such that \u\ > \v\ +7. It follows that the maximum oi Z + g 
cannot be achieved at such a point u, which means that 



argmax{Z(ii) + g{u)} 



< \v\ + 7. 

This completes the proof by definition of v. □ 



To relate the suprema of and V~ with maxima of independent random 
variables, we will partition the interval [f{v — /3n), f{u + a„,)] into a union 
of disjoint intervals Ai and Bi of alternating length, and a remainder inter- 
val Rn, in such a way that the length of the small blocks Ai is 

2||/'||c 



(25) 



-n 



-1/3 log 



n. 



infte[o,i] -^'(0 

and the length of the big blocks Bi is L„ = 2n^^/'^(logn)^. More precisely, 
for i = 1, 2, . . . , Kn, where 



(26) 



Kn 



f{u + an) - f{v- (3n) 
In "I" -^n 
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let 



(27) 



A, = [f{v - f3n) + l)(/„ + L„), fiv - (3n) + iln + l)Lnl 
= [f{v - /3n) + iln + l)Ln,f{v - /?„) + i(/„ + L,)], 



and let Rn = [f{v - /3„,) + -fC„(/„ + L„), /(u + a„)], so that In + Ln< \Rn\ < 
2{ln + Ln) and 



(28) 



[f{v-(3n),f{u + an)] = ( U^O U ( 



Now, suppose that <u < v <1, and an>/3n satisfy the conditions of Theo- 
rem 3.2 and let (Cj)'tgN be a sequence of independent processes, all distributed 
like C given in (3). Then, using scaling properties of the Brownian motion, 

we can build (possibly dependent) copies (C]^^)i (C]^^) of (Cj)«eNi such that 



(29) Sb < 



^"""^"^^^ < 5« V ^(f) V Op(logn)V3 + Op(logn)-2/3, 



1 + 0(1/ log n) 



where 



Sb= max sup |Cj(c)| and 5^^"* = max sup |Ci'^^(c)|, 

l<i<K„ ce[0,Ai„] l<i<i^n ce[0,Ain] 



•^A ^ = „ I Ci (c) I , 



2<i<itncG[0,5.„j 

with Kn defined in (26) and where uniformly in i, 



Ai„ = (l + o(l))(logn) 



L'igih))f'{g{h)) 



-1/3 



and <6in < Clogn, for some C > 0, where 6j denotes the midpoint of the 
interval Bi defined in (27). See Lemma 6.9 in [8] for a rigorous proof of (29). 

(2) 

The fact that i > 2 in the definition of is due to the fact that the first 
small block Ai has to be treated separately. 

At this stage, we need a precise control of the tail probabilities of the 
supremum of the limiting process C over increasing intervals. Specifically, in 
Supplement A of [8], we obtain the following slight variation on Theorem 1.1 
in [16]. Suppose (5„ — > oo, r„ ^ and Un — > oo, in such way that Un/dn 0, 
SnfJ-iun)/rn ^ 1, and log(T„)/(5^^ 0. Then 



(30) 



logP(supcg[o,5j |C(c)| <Ur 



-2r„ 



1 



0. 



See Lemma 6.3 in [8] for a rigorous proof. 

We are then in the position to establish Theorem 3.2 and Corollary 3.1. 
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Proof of Theorem 3.2. Let (u„)„ be a sequence such that tf„ — > oo 
in such a way that 

(31) ^ T > 0, 

where fi is the density of C(0). We wiU bound ¥(Sn < Un), where Sn is defined 
by (9), from above and below by means of (29). Write 



'S'i= max sup |cf^(c)|, 
i<j<i^„ce[o,A, " 



82= max sup |Cf^''(c)|. 

Then, according to (29) 

^{Sn < Un) > P((l + 0(l/l0gn)){5i V ^2 V Q„} + i?„ < Un), 

where Qn = Op(logn)^/^ and Rn = Op(logn)~^/^. Define the event En = 
{(logn)2/3|i?„| < 1}, then F{E^) 0, so that 

nSn < Un) > W{Si V ^2 V Q„ < Vn) + o(l) 

= < Vn, S2 < Un,Qn < Vn) + o{l), 

where 

u„- (logn)-2/3 , 

= 1 , ^rn r~iT ~ ~ (log") asn^oo. 

1 +0((logn)-^) 

From (4) and (31), it is easily verified that Un is of order (logn)^/'^ [see also 
the expansion (34) below] and that 

(32) n^^^Kvn) ^ r. 

Therefore, since P((5n < Vn) — > 1, we have 

< Un) > P(5i <Vn,S2< Vn) + o(l). 

We will investigate P(S'i < Vn) and P(S'2 < Vn) separately. 
Since the processes (^^^ are independent copies of C, 

nSl<Vn)=X[^[ sup \C{c)\<Vn 
j=l ^cG[0,Ai„] 

For each i = 1, 2, . . . , Kn fixed, we apply (30), with 



A,„ = (l + o(l))(logn)^ 



L\g{h))r{g{h)) 



-1/3 



which is of the order (logfi)^ uniformly in z, and Tin — 

rA inTi I , where 

the hi are the midpoints of the Kn big blocks Bi. The hi are equidistant at 
distance In + = 2n~^/'^(logn)^(l + 0(logn)~^). Since Tin uniformly 
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in i and Vn is of order (logn)^/'^, we conclude that 

TTpf sup \C{c)\<Vn) 
ZTl ^cG[0,A„,] ^ 

= JJexp(-2Ti„(l + o(l))), 
1=1 

where the small o-term is uniform in i. Therefore, 
IIf( sup |C(c)|<^„ 

/(«) 1 I 

It follows that 

P(S,<„„)^exp{-2.£||£idt 

The probability ^{82 < Vn) can be treated in the same way: 

K„ 

nS2<Vn) = llF( sup \C{c)\<Vn^ 



exp< — 2r 



exp<^ -2(l + o(l))r^<5i„n-i/3 1^1, 



since, according to (29) and (26), 

J]](^mn^/^ < Cn"i/3^n log n = O(logn)"^ 

This yields that 



liminf P(5n < Un) > exp|-2r / 
Similarly, with (29), 



(4L'(t))V3 



dty 



HSn<Un)<F{ max sup \Ciic)\ <Vn) + o{l), 
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where Vn satisfies (32). This probability can be treated completely similar 
to P(S'i < Vn)-, so that 



limsupP(5„ < Un) < exp< — 2t / 

n^oo [ Ju 

This proves the theorem. □ 



(4L'(f))i/3 



Proof of Corollary 3.1. Let {un)n be a sequence such that ii„ — t- oo 
in such a way that n^^^^{un) — )■ r > 0, as n — )■ oo. Taking logarithms in (4), 
we conclude that should satisfy 

1 2 r 

(33) -logn + logu„ - -u^ - Ku„ = log — + o(l) asn-^oo. 

This means that — 2n^/3 is the dominating term, which should compen- 
sate (logn)/3. Therefore, if we write Un = 2~^/'^(log?i)^/^ + 5„, where (5„ = 
o(logn)^/^, and insert this in (33), we obtain 



1 



log n + log 



2 f log n 



logn 



1/3 



+ 3 



logn\ 



2/3 



logny/^^2 



logny/^ 



log ^ + 0(1) 



Tedious, but straightforward computations first yield that (5„ ^ and 
then that 

<5„=-^(logn)-V3+4 



41/3 
(logn) -2/3 



6 



-(logn) 2/'^ log log 



n 



logr log(2A) , 41/3 ■ 
H log 2 



21/3 2V3 



6 



+ o(logn) 



2/3 



If we put t4 i/^Cf/^ = e ^, or — logr = x + log Cfj^ — (2 log 2)/3, this implies 



that 



(34) 



-1/3 



+ 



4I/3 



(log n) log log 



n 



/, N-2/3/^^ + log C'al logA\ . 
(log n) / ^^^^ + ^ J + n) 



-2/3 



If we also write Un = x/an + hn + o(logn) ^Z'^, with 
a„ = 2V3(logn)2/3, 



(logn 



,1/3 



^ 4^/^ log logn ^ log(AC/,L) 



21/3 (41ogn)i/3 6(log7i)2/3 21/3 (log n)2/3' 
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then (10) is equivalent to F{an{Sn — bn) + o(l) < x} — > exp{— e ^}. Finally, 
it is easy to see that 



This proves Corollary 3.1. □ 

5. Proof of Theorems 2.1 and 2.2. We suppose in the sequel that as- 
sumptions (Al), (A2) and (A3) are fulfilled. As before, C, Ci, C2, . . . denote 
positive real numbers that depend only on q, Cg, /, L, Cq, and possibly also 
on a under the additional assumption (A4). It follows from the definition 
of /„ that it can be discontinuous only at the jump points of Fn- In partic- 
ular, the number of jump points of fn is finite. In the sequel, we will denote 
this number by A^^^ — 1 (note that Nn > 1). Moreover, we set tq = 0, Tjy^ = 1, 
and in the case where /„ has at least one jump point, Jhat is, A„ > 2, we 
denote by ri < • • • < tn^-i the ordered jump points of /„. 

To prove Theorems 2.1 and 2.2, we need a precise uniform bound on 
the spacings between consecutive jump points of /„. This is given by the 
following lemma. 

Lemma 5.1. Assume (Al) and (A2). Then 



Proof. It follows from the definition of and C/„ that these functions 
are nonincreasing left-continuous step functions with finitely many jump 
points, and that the maximal length of the flat parts of /„ is precisely the 
maximal height of the jumps of C/„. Therefore, 





(35) 




max |Tj — Ti-i \ = sup limC/„(6) — Un{a) . 

l<i<Nn am bi<^ 

Using the triangle inequality, it follows that 



But g is continuous on M, so that Theorem 3.1 implies that 





which completes the proof. □ 
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Remark 5.1. Lemma 5.1 together with the identity 1 = ^i=i{Ti — Ti-i) , 
implies that 1/7V„ = Op(n-^/2(log?i)^/^). This gives some idea about the 
order of magnitude of the number of jumps of /„. Further investigation 
is needed to obtain a sharp upper bound, and we conjecture that it is of 
order n^/^. This rate is also claimed in Theorem 3.1 in [13]. 

We will also need a bound on the mean absolute error between fn and /. 
In Supplement C in [8], we reprove Theorem 1 in [7] under slightly weaker 
assumptions; that is, there exists C > such that 

(36) E|/„(t)-/(t)|<Cn-V3 

for all t G [n~^/3, 1 — n~^/^] and 

(37) nl{t)-fit)\<C[n{tA{l-t))]-'/' 
for all t£ (0,n-^/3] U See Lemma 6.10 in [8]. 

Note that^he number of jump points of C/„ is precisely the number of 
flat parts of fn, that is Nn, and denoting by 71 > • • • > 7Ar„ the jump points 
of Un, we have 

(38) 7j = fnin) and n = Un{li) for i = 1, 2, . . . , iV„. 

We will show that in order to study the supremum of |/„ — / | over an interval, 
we can restrict ourselves to the situation wherej^he boundaries of the interval 
are jump points of fn and where the values of fn stay in (/(I), /(O)). Indeed, 
in order to relate the supremum of \fn — f \ to the supremum of \Un — g\, 
we need to employ the identity 7j = f{g{'^i))-, for 7^ = fn{Ti), so we need to 
make sure that fn{Ti) G (/(l),/(0)). To this end, define for any t € (0, 1) 

(39) ii (t) = min{i € {1, 2, . . . , iV„} such that n > t} 
and 

(40) i2(t) = max{2 G {0, 1, . . . , iV„ - 1} such that Tj < 1 - t}. 

For any t such that n^^^t 00 and n^^^{l — t)^ 00, we establish the order 
of the difference with neighboring points of jump of fn, that is, 

(41) ri = t + Op{n-^/'^) 

for i = ii{t) — l,ii{t),ii{t) + 1, and similarly for 1 — t, 

(42) r^ = l-t + Op{n~^/'^) 

for i = i2{t) — I,i2{t),i2{t) + 1. See Lemma 6.11 in [8]. Note that if there are 
no jumps on the interval [s, 1 — t), then Tj^(s) > ^12(1)- This may happen if 
the length 1 — t — s oi the interval tends to zero too fast. However, if 

(43) n^/^s 00, n^/^t 00 
and 

(44) n^/3(l-t-s)^oo. 
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then 

(45) ns<n^{s)<Ti,it)<l-t)^l. 
See Lemma 6.12 in [8]. According to Lemma 6.13 in [8], 
lP(7i < /(O) for all i > ii{s)) 1, 

(46) 

P(7i>/(1) for alli<i2{t))^l, 

whenever (44) holds, which ensures that fn{Ti) G (/(I)) /(O)) simultaneously 
for various I's, with probability tending to one. 
We are then in the position to prove Theorem 2.1. 

Proof of Theorem 2.1. First, we establish the result for sequences 
On = Sn and fin = tn that satisfy (43) and (44). For the sake of brevity, write 
ii = ii{sn) and 12 = i2{tn)- Define the event 

En = {Sn < Til < < 1 - tn} 

(47) 

n{7ie(/(l),/(0)) for alU = h,...,i2}- 

Then according to (45) and (46), we have P(-E'n) — ^ Ij so we can restrict 
ourselves to the event En- We have 

sup \fn{u)-f{u)\ 

Me(Sn,l-tn] 

< max sup \fn{u) - f{u)\+ sup |/„(n) - /(ti)|. 
1=11,.. 12 ue{Ti^un] ue{n^,i-t„] 

Recall that fn is constant on every interval (rj_i , Tj] , for i = 1 , 2, . . . , Nn — 1 ■ 
Moreover, /' is bounded. Using the triangle inequality, it follows that 

sup \fn{u) - f{u)\= sup Ifnin) - f{u)\ 

"S{ri_l,Ti] «e{Ti_l,Ti] 

<l/n(r.)-/(r.)| + ||/'||^|r,_i-Ti| 

for all i = 1, 2, . . . , A'^^ — 1 and 

sup \fniu) - f{u)\ < \fnil - tn) " /(I - tn)\ + \\f'\\oo\^i2 " Ti2+l\- 

UG{Ti^,l-t„] 

From (43) and (44), we have 1 — tn € [n~^/^, 1 — n~^/^], for large enough n, 
so (36) ensures that Aj(1 - 1„) - /(I - 1„) = Op(n-i/3). Using (35) and (38), 
it follows that 

sup \fn{u) - f{u)\< max Ifnin) - f {Ti)\ + Op 
we{sn,l-M ^=^l,■■■,^2 

= max |7i - / o C/„(7j) I + Op 

l=ll,...,l2 
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On the event En, we have 7, = / o (7(7^), for all i = ii, . . . ,12, and therefore 

sup |/„(^z)-/(n)|<||/'||^. max. \g(j,) - Uni7^)\ + oJ^-^) 

U&{Sn,l-tn] 1=11,.. ..12 \ ^ ) 

< II/'IL sup|C/„(a) - g{a) \ + O, (^-^) "\ 

Theorem 2.1, with a„ = Sn and /3„ = tn satisfying (43) and (44) now follows 
from Theorem 3.1. 

It remains to extend the result to more general sequences a„ and For 
this purpose, define Sn = n~^/^(logn)^/^. In view of the foregoing results, we 
know that 

(48) sup \Ut)-f{t)\ = oJ^^\'\ 

tG(s„,l-s„] \ ^ J 

Suppose a„ and /?„ satisfy (1). Let us notice that sup^g^^^ \fri{t) ~ 
f{t)\ decreases when either a„ or /3„ increases, so that we can restrict our 
attention to small values of a„ and /3„. Without loss of generality we may 
assume that a„ < n~^/^ < s„ and /3„ < 

We then use the following property of nonincreasing functions hi and /i2 
on an interval [a, 6]: 

sup \hi{t) - h2{t)\ 

(49) 

< \hi{a) - h2{a)\ V \hi{h) - h2{h)\ + \h2{a) - h2{h)\. 

See Lemma 6.1 in [8]. Since and / are both nonincreasing, according 
to (49), we have 

sup \7n{t)-f{t)\ 

< |/n(«n) - /(an)| V |/(s„) - /„(s„)| + ||/'|L(s„ - a„). 

Because s„ € [n~^^^, 1 — n~^/'^], it follows from (36) and (37) that /(s„) — 
fnisn) = Op{n^^/^) and /„(«„) - /(««) = Op((f^an)~"^^^), which is of the or- 
der Op(n~^/^(log?i)^/^), as we have assumed that > Kin~^^^ {logn)~'^^^ . 
We conclude 

sup \fn{t)-f{t)\=Op(^-^Y\ 

Similarly, we obtain 

'1 \ 1/3 
log n ^ ' 



sup \fn{t)-f{t)\=Op 

te(i-s„,i-/3„] 



n 
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and therefore, 

sup |7„(i)-/(t)|= sup \fn{t)-f{t)\VO, 

te{a„,l~f3n] t€{Sri,l-S„] 

Theorem 2.1 now follows from (48). □ 

To prove Theorem 2.2, similarly to the proof of Theorem 2.1, we first 
establish the result for sequences s„ = u + a„ and t„ = f — satisfying (43) 
and (44), and then extend the result to more general sequences. The first 
step is to prove that the behavior of supremum over the interval {sn, 1 — tn] 
is dominated by that of the largest interval between two jump points of /„, 
contained in (s„,, 1 — 1„,]. For this task, we make use of the notation Tj, ji, ii 
and i2 as introduced in (38), (39) and (40), and for t € [0, 1], we define the 
normalizing function 

(50) B{t) = {4\f'{t)\L'{t)r'/\ 

It is easy to see that under assumptions (Al), (A2) and (A4), there exists 
Co > and a S (0, 1] such that 

(51) \A{u) - A{v)\<Co\u-v\'' and \B{u) - B{v)\ < Co\u - v]" 

for all u,v ^ [0,1], where A is given by (8). Recall that by convention, the 
supremum of an empty set is equal to zero. 

For s,t that satisfy conditions (44) and (43), we first obtain 

sup B{u)\Jn{u)- f{u)\ 

u£(s,l-t] 

(52) 

sup B{u)\fn{u)- f{u)\yOp{n-^'^). 

See Lemma 6.14 in Supplement C in [8]. We are then able to make the 
connection between C/„ and /„. 

Lemma 5.2. Assume (Al), (A2), (A3) and (A4). Let Q < s <l - t <l, 
possibly depending on n, such that s,t satisfy conditions (43) and (44)- Then 

sup B{u)\fn{u) - f{u)\ 

u€{s,l-t] 

sup A{a)\Un{a) - g{a)\ + O. 
«e[/(i-t)J(«)] 

for some a € (0, 1] . 

Proof. Again write ii = ii{s) and i2 = ^2(i)- We first decompose the 
supremum into maxima of suprema taken over intervals between succeeding 





logn^ ^^'^ 
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jump points of /„: 

sup B{u)\Jn{u) - f{u)\= max sup B{u)\Jn{u) - f{u)\. 

Then, by Theorem 2.1 and (35), we have that 

sup \7n{u) - f {u)\ < sup \Jn{u) - f{u)\ = Op 

Thus, we obtain by means of (51) and the triangle inequahty that 
sup B{u)\fn{u) - f{u)\ 

/logn^('^+^)/3 
= max B{Ti) sup \fn{u) - f{u)\ + Op[ 

n + l<t<t2 «6(r,_i,r,] V 

By monotonicity of /, we have for all ii + 1 < i < ^2) 

sup \Uri) - f{u)\ = |/„(r,) - /(r,)| V |/„(r,) - /(r,_i)|. 

uein^i, Ti] 

Hence, with (38) we arrive at 
sup B{u)\fn{u) - f{u)\ 

max^. i?(r,){|7-/(ri)|V|7.-/(r._i)|}+Op( ^ 



On the event En of (47), we can write 7^ = f{g{'^i)) for all i = ii + 1, . . . , ^2, 
which, in view of (38), implies that 

\l^-f{r,)\ = \g{^,)-Un{l^)\■\nea)l 

\l^ - /(ri_i)j = \g{ii) - C/„(7.~i)l • 1/(^^2)1 

for some On between 5(71) and Uniji), and 6i2 between 5(74) and Uni'Ji-i)- 
By (46), Theorem 3.1 and (2), it follows that 

(53) |7. - f{T^)\ = \g{7^) - UM\ ■ \f{g{7^))\ + Op[-j^j 

By (38), (35) and Theorem 3.1, we have that 

max \g{ji) - Un{'ji-i)\ = max \g{ji) - Uni'Ji) + n - Ti_i\ 

ii+l<i<i2 iiH-l<j<i2 

/] \ 1/3 

(54) <snp\g{a)-Unia)\ + 0J^] 

am \ n J 

logn^ ^^'^ 
n 
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SO that similarly as above, 



|7i - /(r,_i)| = \g{-fi) - Un{l^-l)\ ■ \f'{9i7^))\ +Op(^ 



1 \ {o"+l)/3 

iogn ^ ^ " 
n 



It follows that 

sup B{u)\fn{u) - f{u) 



:. max B{Ti)\f'{g{^m\9{l^)-Un{l^)\y\g{l^)-Un{l^^l)\} 

iogn ^ ^ " 



+ Op 

n 

In order to replace B{Ti) by B{g{'^i)), we first note that (51), (53) and 
Theorem 3.1 imply that uniformly in f, 

|i?(r,)-i?(5(7^))l<Co|T,-5(7.)r 

<coib'ci/(T.)-7.r=Op(^)''^'. 

By definition of A and B, we have A{a) = B{g{a))\f' {g[a))\, for all a G M, 
so from Theorem 2.1 and (54), we conclude that 

sup B{u)\fn{u) - f{u)\ 
= max A{ji)\g{ji) -Un{7i)\y max A{ji+i)\g{ji+i) - Un{7i)\ 

il-|-l<j<i2 ii<i<i2 — l 

\ (o-+l)/3 



n 

By the triangle inequality, on the event £"„ of (47) we can write 

h+i - 7.1 < Il/'lloo{l9(7.+i) - UnM\ + |<7(7.) - C>n(7.)l} 
for all ii < i < ^2 — 1, so that Theorem 3.1 together with (54) implies that 

/logn^ 1/3 

(55) max |7i+i - 7^] = Op ( 

«i<r<i2-l \ n 

Together with (51) and (54), this allows us to replace A{'yi^i) by ^(7^), so 
that 

sup B{u)\fn{u) - f{u)\ 

«e{Tij^ .Tig] 

= . max ^(7i)|c/(7i)-{7„,(7i)|V max A{-fi)\g{ji+i) - Un{-fi)\ 

11-|-1<2<J2 li<l<J2 — 1 



+ 0p 



1 \ (o"+l)/3 

iogn ^ ^ " 



n 
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Now, recall that Un is constant on intervals (7j_(_i,7j], and g is monotone. 
This implies that 

sup \Un{a) - g{a)\ = \Un{li) - g{li)\ V \Un{li) - g{li+i)V 

ae(7i+i,7i] 

Therefore, taken into account joint indices, we find that 
sup B{u)\Jn{u) - f{u)\ 

= max A{'yi) sup \Un{a) - g{a)\ 

,,+l<j<,2_l aG(7i+l,7«] 



V ^(7*2) 15(7*2) - Un{l^,)\ V ^(7,J|g(7,,+i) - C/„(7*J| 

+ 0. 



1 \ ('^+l)/3 
logn ^ ^ " 



n 



Next, consider the term A(7jJ| (7(74^+1) — C/n(7ii)|, and let e > 0. According 
to (43) and (41), there exists C > such that P(/n) > 1 — e, for n sufficiently 
large, where In = {ti-^ — s < Cn~^/^}. By monotonicity, we have on this event 
that 7i^ = fnijh) is between fn{s + Cn~^/^) and fn{s), which are both equal 
to f{s) + Op{n~^^^) by (36). A similar argument holds for 7^1+1, so that 

(56) 

7,,+i = /(s) + Op(n-V3). 
Together with (38) and (46), this implies 

lailh+i) - Un{7n)\ = lailh+i) - 9{f{rh))\ 

< Ilff'llool7n+i -/(^.i)l 

= y\\Jm-f{r^,)\+o,in-'/') 

< llff'llooll/'llool^ - I + 0,{n-'/') = 0,(n-V3). 
Similarly, it follows that 

(57) \9{li2)-Unil^,)\ = OJ,{n-'/^'), 

since by the same arguments as above, 7^2 = /n(7"i2) is between /„(1 — i) and 
fn{l — t — Cn~^/^) with probability greater than 1 — e, and both terms are 
equal to /(I -t) + Op(n~i/3)^ 

Since A is bounded, we conclude that 
sup B{u)\fn{u) - f{u)\ 

= max A{-fi) sup \Un{a) - g{a)\y Op{rr^^'^) 

n+l<i<i2-l ae(7>+i,7z] 



+ 0p 



1 \ (o"+l)/3 

logn ^ ^ " 



n 



THE Loo -ERROR OF GRENANDER-TYPE ESTIMATORS 27 

To replace ^(7i) by A{a) for a € (7i+i, 7^], we use (55), (51) and Theorem 3.1. 
Together with (52), we conclude that 



sup B{u)\ fniu) - f{u) 

u£{s,l-t] 

(58) 



sup A{a)\Un{a)-g{a)\yOp{n-^/^) + Op 

ae(7i2''>'ii+l] 



1 \ {o"+l)/3 

log n ^ ^ " 



n 



It remains to extend the latter supremum to the interval [/(I — t),f{s)]. We 
have 

sup A{a)\Unia) - g{a)\ <\\A\\oo sup \Un{a) - g{a)\. 

a6[/(l-t),7«2] a6[/(l-t),7«2] 

According to (49), 

sup \Un{a) - g{a)\ < |^„(/(1 - t)) - g{f{l - t))\ 

aG[/(l-i),7i2] 

y\Un{l^,)-g{li,)\ + \\g'\Ul^,-f{l-t)\. 

Similarly to (56), we can write = f{l — t) + Op{n~^^^). Together with (12) 
and (57) we obtain 

sup |[/„(a)-5(a)|=Op(n-V3) 

ae[/(l-i),7»2] 



and likewise, 



sup \Un{a)-g{a)\ = Op{n~^l''). 

aG[7ii + iJ{«)] 



1 \ (o"+l)/3 

logn \ ^ ' " 



From (58), we conclude that 

sup B{u)\Jn{u) - f{u)\ 

ne(s,l-t] 

(59) 

= sup A{a)\Un{a)- g{a)\y Rn + Op 
aelf{t)j{s)] V n 

where i?„ = Op{n~^^^). We have 

sup A{a)\Un{a)-g{a)\V Rn^ sup A{a)\Un{a) - g{a)\) 

ae[f{t)J{s)] ae[f{t)J{s)] ' 

<p(i?„> sup A{a)\dn{a)-g{a)\ 

^ aG[/{t),/{s)] 

But it follows from Corollary 3.1 that 
/] \ -1/3 

(60) {^\ sup ^(a)|f/„(a)-g(a)|= 2-1/3 + 0^(1). 

V ™ / ae[/(i),/(s)l 
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Since Rn = Op{{n/ logn)^^/'^) , it follows that the latter probability tends to 
zero as n — )• oo. The lemma now follows from (59). □ 

Proof of Theorem 2.2. Let 5„ be defined by (9), with 0<u<v<l 
fixed and a„ and /3„ satisfying (5). Let 

(61) Sn = u + an and tn = l- v + /3n- 

Then automatically s„ and t„ will always satisfy condition (44). If, in addi- 
tion, Sn and tn satisfy condition (43), then according to Lemma 5.2 together 
with (60), 

sup B{t)\fn{t)-f{t)\ 

te{u+a„,v-(Sn] 

has the same limit distribution as 

sup A{a)\Un{a) - gia)\, 

ae[/(t;-/3„),/(«+a„)] 

SO that Theorem 2.2 follows from Corollary 3.1. When < u < v < 1, then Sn 
and tn automatically satisfy (43), so we only have to consider the cases where 
either u = or v = 1. If u = and n^/^a„ oo, or if = 1 and n^^^Pn oo, 
then Sn and tn, as defined in (61), also satisfy condition (43). Therefore, we 
can restrict ourselves to the case an = 

0(n-V3) and pn = 0(n-i/3). 
Define = n~^/^{\ognY^^ , so that u + an <u-\- an <v — an < v — Pn- By 
means of (49), we find 

sup \fn{t)-f{t)\ 

< \ fn{u + an) - /(n + a„)| V |/^,(ii + a„,) - /(u + a„,)| 

+ |/(n + a„) - /(n + a„)|. 

By definition, |/(M + a„) -/(u + a„)| < ||/'|| \=0{n ^/'^{lognY/^), 

and according to (36) and (37), together with (5), 

fn{u + an) - f{u + an) = Op((na„)-i/2) = Op{n-^/\\ogn)^/''), 

fn{u + an) - f{u + an) = Opin'^/'^). 
Because B{t) is uniformly bounded, it follows that 

logn^ 



sup B{t)\fn{t)-f{t)\=Op 



n 



1/3 



and likewise 

logn 



sup B{t)\fn{t)-f{t)\=Op 

t&{v — an,V-j3n] 



n 



1/3 
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This means that 

sup B{t)\Ut)-f{t)\= sup B{t)\U{t)-f{t)\yRn, 

where Rn = Op((n/ log Because n + a„ and 1 — v + Un satisfy the 
conditions of Lemma 5.2, together with (60), it follows that 

sup B{t)\fn{t)-f{t)\ 

has the same limit distribution as 

sup A{c)\Un{c) - g{c)\, 

c6[/{'!'-an),/{M+an)] 

so that Theorem 2.2 follows from Corollary 3.1. □ 
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